The Ilp Description Learning Problem: towards a General Model-level Deenition of Data Mining in Ilp
نویسنده
چکیده
The task of discovering interesting regularities in (large) sets of data (data mining, knowledge discovery) has recently met with increased interest in Machine Learning in general and in Inductive Logic Programming (ILP) in particular. However, while there is a widely accepted deenition for the task of concept learning from examples in ILP, deenitions for the data mining task have been proposed only recently. In this paper, we examine these so-called "non-monotonic semantics" deenitions and show that non-monotonicity is only an incidental property of the data mining learning task, and that this task makes perfect sense without such an assumption. We therefore introduce and deene a generalized deenition of the data mining task called the ILP description learning problem and discuss its properties and relation to the traditional concept learning (prediction) learning problem. Since our characterization is entirely on the level of models, the deenition applies independently of the chosen hypothesis language.
منابع مشابه
The Construct of Interlanguage Pragmatic Learning Strategies: Investigating Preferences of High vs. Low Pragmatic Performers
Interlanguage pragmatics (ILP) has witnessed a growing body of research in the past two decades. One of the under-explored domains of L2 pragmatics is the role of learning strategies specifically tailored for the development of ILP knowledge. Therefore, this investigation aimed to determine the significant interlanguage pragmatic learning strategies (IPLS) used by high vs. low L2 pragmatic achi...
متن کاملA New ILP Model for Identical Parallel-Machine Scheduling with Family Setup Times Minimizing the Total Weighted Flow Time by a Genetic Algorithm
This paper presents a novel, integer-linear programming (ILP) model for an identical parallel-machine scheduling problem with family setup times that minimizes the total weighted flow time (TWFT). Some researchers have addressed parallel-machine scheduling problems in the literature over the last three decades. However, the existing studies have been limited to the research of independent jobs,...
متن کاملFrequent query discovery: a unifying ILP approach to association rule mining
Discovery of frequent patterns has been studied in a variety of data mining (DM) settings. In its simplest form, known from association rule mining, the task is to nd all frequent itemsets, i.e., to list all combinations of items that are found in a suucient number of examples. A similar task in spirit, but at the opposite end of the complexity scale, is the Inductive Logic Programming (ILP) ap...
متن کاملInductive Logic Programming for Data Mining in Economics
This paper addresses the problem of data mining in Inductive Logic Programming (ILP) motivated by its application in the domain of economics. ILP systems have been largely applied to data mining classification tasks with a considerable success. The use of ILP systems in regression tasks has been far less successful. Current systems have very limited numerical reasoning capabilities, which limit...
متن کاملA Comparative Study of Inductive Logic Programming Methods for Software Fault Prediction
We evaluate inductive logic programming (ILP) methods for predicting fault density in C++ classes. In this problem, each training example is a C++ class deenition, represented as a calling tree, and labeled as \pos-itive" ii faults (i.e., errors) were discovered in its implementation. We compare two ILP systems, FOIL and FLIPPER, and explore the reasons for their diiering performance, using bot...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995